Multi-space random mapping for speaker identification

نویسندگان

Chong Tze Yuang

Andrew Beng Jin Teoh

David Chek Ling Ngo

Michael Goh Kah Ong

چکیده

This paper presents a work of utilizing multi-space random mapping (MRM) to formulate a dual-factor identification system, which combines speaker biometric and personal token. Personal token will be assigned to the client to constitute a unique random subspace during enrollment and the speaker template will be generated within the random subspace. Test features will be mapped to the random subspace that is described by the personal token presented during identification. Our work has shown that MRM-system exhibits stronger discriminative ability when comparing test features to its counterfeit templates, which lied in other different random subspaces. This advantage thus contributes to better F-ratio and greater accuracy recognition. Experiments on YOHO corpus demonstrate a remarkable result where the system achieves the perfect identification rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution

In our previous work, we have proposed a speaker modeling technique using spectral and pitch features for text-independent speaker identification based on Multi-Space Probability Distribution Gaussian Mixture Models (MSD-GMMs). We have presented a maximum likelihood (ML) estimation procedure for the MSD-GMM parameters and demonstrated its high recognition performance. In this paper, we describe...

متن کامل

Speaker identification with dual penalized logistic regression machine

This paper proposes a novel speaker identification method based on the dual Penalized Logistic Regression Machine (dPLRM) for general multi-class discrimination. The machine employs kernel functions which implicitly map an acoustic feature space to a higher dimensional space. Each speaker is discriminatively identified in this space implicitly. The penalized logistic regression model used in dP...

متن کامل

Speaker identification using Gaussian mixture models based on multi-space probability distribution

This paper presents a new approach to modeling speech spectra and pitch for text-independent speaker identification using Gaussian mixture models based on multi-space probability distribution (MSD-GMM). The MSD-GMM allows us to model continuous pitch values for voiced frames and discrete symbols representing unvoiced frames in a unified framework. Spectral and pitch features are jointly modeled...

متن کامل

Structured prediction for speaker identification in TV series

Though radio and TV broadcast are highly structured documents, state-of-the-art speaker identification algorithms do not take advantage of this information to improve prediction performance: speech turns are usually identified independently from each other, using unstructured multi-class classification approaches. In this work, we propose to address speaker identification as a sequence labeling...

متن کامل

Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

Speakers are generally identified by using features derived from the Fourier transform magnitude. The Modified group delay feature(MODGDF) derived from the Fourier transform phase has been used effectively for speaker recognition in our previous efforts.Although the efficacy of the MODGDF as an alternative to the MFCC is yet to be established, it has been shown in our earlier work that composit...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

IEICE Electronic Express

دوره 2 شماره

صفحات -

تاریخ انتشار 2005

Multi-space random mapping for speaker identification

نویسندگان

چکیده

منابع مشابه

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution

Speaker identification with dual penalized logistic regression machine

Speaker identification using Gaussian mixture models based on multi-space probability distribution

Structured prediction for speaker identification in TV series

Cluster and Intrinsic Dimensionality Analysis of the Modified Group Delay Feature for Speaker Classification

عنوان ژورنال:

اشتراک گذاری